Dataset statistics
| Number of variables | 29 |
|---|---|
| Number of observations | 35549 |
| Missing cells | 463770 |
| Missing cells (%) | 45.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 7.9 MiB |
| Average record size in memory | 232.0 B |
Variable types
| CAT | 16 |
|---|---|
| NUM | 12 |
| UNSUPPORTED | 1 |
Reproduction
| Analysis started | 2020-08-23 18:45:30.474993 |
|---|---|
| Analysis finished | 2020-08-23 18:45:59.594990 |
| Duration | 29.12 seconds |
| Software version | pandas-profiling v2.9.0rc1 |
| Download configuration | config.yaml |
tag has a high cardinality: 9358 distinct values | High cardinality |
ltag has a high cardinality: 971 distinct values | High cardinality |
yr is highly correlated with recordID and 1 other fields | High correlation |
recordID is highly correlated with yr and 1 other fields | High correlation |
period is highly correlated with recordID and 1 other fields | High correlation |
note1 has 31957 (89.9%) missing values | Missing |
species has 2015 (5.7%) missing values | Missing |
sex has 2506 (7.0%) missing values | Missing |
age has 20103 (56.6%) missing values | Missing |
reprod has 33898 (95.4%) missing values | Missing |
testes has 25857 (72.7%) missing values | Missing |
vagina has 33952 (95.5%) missing values | Missing |
pregnant has 34327 (96.6%) missing values | Missing |
nipples has 30521 (85.9%) missing values | Missing |
lactation has 35423 (99.6%) missing values | Missing |
hfl has 4111 (11.6%) missing values | Missing |
wgt has 3266 (9.2%) missing values | Missing |
tag has 2324 (6.5%) missing values | Missing |
note2 has 30965 (87.1%) missing values | Missing |
ltag has 1901 (5.3%) missing values | Missing |
note3 has 35533 (> 99.9%) missing values | Missing |
prevrt has 1780 (5.0%) missing values | Missing |
prevlet has 2071 (5.8%) missing values | Missing |
nestdir has 33718 (94.8%) missing values | Missing |
neststk has 30113 (84.7%) missing values | Missing |
note4 has 34908 (98.2%) missing values | Missing |
note5 has 32451 (91.3%) missing values | Missing |
prevlet is highly skewed (γ1 = 56.0844) | Skewed |
recordID has unique values | Unique |
prevrt is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
prevlet has 33447 (94.1%) zeros | Zeros |
neststk has 3055 (8.6%) zeros | Zeros |
| Distinct count | 35549 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 17775 |
|---|---|
| Minimum | 1 |
| Maximum | 35549 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 277.7 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1778.4 |
| Q1 | 8888 |
| median | 17775 |
| Q3 | 26662 |
| 95-th percentile | 33771.6 |
| Maximum | 35549 |
| Range | 35548 |
| Interquartile range (IQR) | 17774 |
Descriptive statistics
| Standard deviation | 10262.3 |
|---|---|
| Coefficient of variation (CV) | 0.577342 |
| Kurtosis | -1.2 |
| Mean | 17775 |
| Median Absolute Deviation (MAD) | 8887 |
| Skewness | 0 |
| Sum | 6.31883e+08 |
| Variance | 1.05314e+08 |
| Monotocity | Strictly increasing |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 2047 | 1 | < 0.1% | |
| 21824 | 1 | < 0.1% | |
| 25926 | 1 | < 0.1% | |
| 32069 | 1 | < 0.1% | |
| 30020 | 1 | < 0.1% | |
| 19779 | 1 | < 0.1% | |
| 17730 | 1 | < 0.1% | |
| 23873 | 1 | < 0.1% | |
| 34106 | 1 | < 0.1% | |
| 5416 | 1 | < 0.1% | |
| Other values (35539) | 35539 | > 99.9% |
| Value | Count | Frequency (%) | |
| 1 | 1 | < 0.1% | |
| 2 | 1 | < 0.1% | |
| 3 | 1 | < 0.1% | |
| 4 | 1 | < 0.1% | |
| 5 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 35549 | 1 | < 0.1% | |
| 35548 | 1 | < 0.1% | |
| 35547 | 1 | < 0.1% | |
| 35546 | 1 | < 0.1% | |
| 35545 | 1 | < 0.1% |
mo
Real number (ℝ≥0)
| Distinct count | 12 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.47402 |
|---|---|
| Minimum | 1 |
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 277.7 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 4 |
| median | 6 |
| Q3 | 9 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 3.39658 |
|---|---|
| Coefficient of variation (CV) | 0.524648 |
| Kurtosis | -1.20493 |
| Mean | 6.47402 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 0.0514846 |
| Sum | 230145 |
| Variance | 11.5368 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=12)
| Value | Count | Frequency (%) | |
| 7 | 3633 | 10.2% | |
| 4 | 3443 | 9.7% | |
| 3 | 3390 | 9.5% | |
| 5 | 3073 | 8.6% | |
| 10 | 3064 | 8.6% | |
| 11 | 3016 | 8.5% | |
| 12 | 2799 | 7.9% | |
| 2 | 2796 | 7.9% | |
| 9 | 2751 | 7.7% | |
| 6 | 2697 | 7.6% | |
| Other values (2) | 4887 | 13.7% |
| Value | Count | Frequency (%) | |
| 1 | 2518 | 7.1% | |
| 2 | 2796 | 7.9% | |
| 3 | 3390 | 9.5% | |
| 4 | 3443 | 9.7% | |
| 5 | 3073 | 8.6% |
| Value | Count | Frequency (%) | |
| 12 | 2799 | 7.9% | |
| 11 | 3016 | 8.5% | |
| 10 | 3064 | 8.6% | |
| 9 | 2751 | 7.7% | |
| 8 | 2369 | 6.7% |
dy
Real number (ℝ≥0)
| Distinct count | 31 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16.106 |
|---|---|
| Minimum | 1 |
| Maximum | 31 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 277.7 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 9 |
| median | 16 |
| Q3 | 23 |
| 95-th percentile | 29 |
| Maximum | 31 |
| Range | 30 |
| Interquartile range (IQR) | 14 |
Descriptive statistics
| Standard deviation | 8.25669 |
|---|---|
| Coefficient of variation (CV) | 0.512648 |
| Kurtosis | -1.06417 |
| Mean | 16.106 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | 0.0180591 |
| Sum | 572551 |
| Variance | 68.1729 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=31)
| Value | Count | Frequency (%) | |
| 16 | 1806 | 5.1% | |
| 22 | 1778 | 5.0% | |
| 15 | 1716 | 4.8% | |
| 9 | 1706 | 4.8% | |
| 13 | 1516 | 4.3% | |
| 14 | 1474 | 4.1% | |
| 21 | 1455 | 4.1% | |
| 25 | 1452 | 4.1% | |
| 4 | 1350 | 3.8% | |
| 24 | 1296 | 3.6% | |
| Other values (21) | 20000 | 56.3% |
| Value | Count | Frequency (%) | |
| 1 | 738 | 2.1% | |
| 2 | 593 | 1.7% | |
| 3 | 746 | 2.1% | |
| 4 | 1350 | 3.8% | |
| 5 | 1210 | 3.4% |
| Value | Count | Frequency (%) | |
| 31 | 684 | 1.9% | |
| 30 | 1020 | 2.9% | |
| 29 | 1153 | 3.2% | |
| 28 | 903 | 2.5% | |
| 27 | 702 | 2.0% |
| Distinct count | 26 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1990.48 |
|---|---|
| Minimum | 1977 |
| Maximum | 2002 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 277.7 KiB |
Quantile statistics
| Minimum | 1977 |
|---|---|
| 5-th percentile | 1979 |
| Q1 | 1984 |
| median | 1990 |
| Q3 | 1997 |
| 95-th percentile | 2002 |
| Maximum | 2002 |
| Range | 25 |
| Interquartile range (IQR) | 13 |
Descriptive statistics
| Standard deviation | 7.49336 |
|---|---|
| Coefficient of variation (CV) | 0.00376461 |
| Kurtosis | -1.28477 |
| Mean | 1990.48 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | -0.0441459 |
| Sum | 7.07594e+07 |
| Variance | 56.1504 |
| Monotocity | Increasing |
Histogram with fixed size bins (bins=26)
| Value | Count | Frequency (%) | |
| 1997 | 2493 | 7.0% | |
| 2002 | 2229 | 6.3% | |
| 1982 | 1978 | 5.6% | |
| 1996 | 1706 | 4.8% | |
| 1983 | 1673 | 4.7% | |
| 1987 | 1671 | 4.7% | |
| 2001 | 1610 | 4.5% | |
| 1998 | 1610 | 4.5% | |
| 1989 | 1569 | 4.4% | |
| 2000 | 1552 | 4.4% | |
| Other values (16) | 17458 | 49.1% |
| Value | Count | Frequency (%) | |
| 1977 | 503 | 1.4% | |
| 1978 | 1048 | 2.9% | |
| 1979 | 719 | 2.0% | |
| 1980 | 1415 | 4.0% | |
| 1981 | 1472 | 4.1% |
| Value | Count | Frequency (%) | |
| 2002 | 2229 | 6.3% | |
| 2001 | 1610 | 4.5% | |
| 2000 | 1552 | 4.4% | |
| 1999 | 1135 | 3.2% | |
| 1998 | 1610 | 4.5% |
| Distinct count | 322 |
|---|---|
| Unique (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 149.534 |
|---|---|
| Minimum | -284 |
| Maximum | 295 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 277.7 KiB |
Quantile statistics
| Minimum | -284 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 73 |
| median | 149 |
| Q3 | 234 |
| 95-th percentile | 287 |
| Maximum | 295 |
| Range | 579 |
| Interquartile range (IQR) | 161 |
Descriptive statistics
| Standard deviation | 97.0927 |
|---|---|
| Coefficient of variation (CV) | 0.649301 |
| Kurtosis | -0.313261 |
| Mean | 149.534 |
| Median Absolute Deviation (MAD) | 82 |
| Skewness | -0.408557 |
| Sum | 5.31579e+06 |
| Variance | 9427 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| -44.5 | 342 | 1.0% | |
| 233 | 312 | 0.9% | |
| -62 | 306 | 0.9% | |
| 232 | 290 | 0.8% | |
| 230 | 267 | 0.8% | |
| 290 | 260 | 0.7% | |
| 231 | 258 | 0.7% | |
| 229 | 256 | 0.7% | |
| 288 | 243 | 0.7% | |
| 228 | 238 | 0.7% | |
| Other values (312) | 32777 | 92.2% |
| Value | Count | Frequency (%) | |
| -284 | 12 | < 0.1% | |
| -283 | 12 | < 0.1% | |
| -278 | 12 | < 0.1% | |
| -277 | 12 | < 0.1% | |
| -267 | 10 | < 0.1% |
| Value | Count | Frequency (%) | |
| 295 | 164 | 0.5% | |
| 294 | 197 | 0.6% | |
| 293 | 218 | 0.6% | |
| 292 | 213 | 0.6% | |
| 291 | 88 | 0.2% |
plot
Real number (ℝ≥0)
| Distinct count | 24 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.397 |
|---|---|
| Minimum | 1 |
| Maximum | 24 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 277.7 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 5 |
| median | 11 |
| Q3 | 17 |
| 95-th percentile | 22 |
| Maximum | 24 |
| Range | 23 |
| Interquartile range (IQR) | 12 |
Descriptive statistics
| Standard deviation | 6.79941 |
|---|---|
| Coefficient of variation (CV) | 0.596596 |
| Kurtosis | -1.14782 |
| Mean | 11.397 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 0.121433 |
| Sum | 405152 |
| Variance | 46.2319 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=24)
| Value | Count | Frequency (%) | |
| 12 | 2365 | 6.7% | |
| 2 | 2194 | 6.2% | |
| 17 | 2039 | 5.7% | |
| 1 | 1995 | 5.6% | |
| 4 | 1969 | 5.5% | |
| 9 | 1936 | 5.4% | |
| 11 | 1918 | 5.4% | |
| 8 | 1891 | 5.3% | |
| 14 | 1885 | 5.3% | |
| 3 | 1828 | 5.1% | |
| Other values (14) | 15529 | 43.7% |
| Value | Count | Frequency (%) | |
| 1 | 1995 | 5.6% | |
| 2 | 2194 | 6.2% | |
| 3 | 1828 | 5.1% | |
| 4 | 1969 | 5.5% | |
| 5 | 1194 | 3.4% |
| Value | Count | Frequency (%) | |
| 24 | 1048 | 2.9% | |
| 23 | 571 | 1.6% | |
| 22 | 1399 | 3.9% | |
| 21 | 1173 | 3.3% | |
| 20 | 1390 | 3.9% |
| Distinct count | 11 |
|---|---|
| Unique (%) | 0.3% |
| Missing | 31957 |
| Missing (%) | 89.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.3221 |
|---|---|
| Minimum | 1 |
| Maximum | 13 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 277.7 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 5 |
| median | 6 |
| Q3 | 13 |
| 95-th percentile | 13 |
| Maximum | 13 |
| Range | 12 |
| Interquartile range (IQR) | 8 |
Descriptive statistics
| Standard deviation | 4.21671 |
|---|---|
| Coefficient of variation (CV) | 0.575888 |
| Kurtosis | -1.41534 |
| Mean | 7.3221 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 0.237883 |
| Sum | 26301 |
| Variance | 17.7807 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=11)
| Value | Count | Frequency (%) | |
| 13 | 1068 | 3.0% | |
| 5 | 858 | 2.4% | |
| 2 | 688 | 1.9% | |
| 9 | 427 | 1.2% | |
| 6 | 376 | 1.1% | |
| 4 | 70 | 0.2% | |
| 1 | 58 | 0.2% | |
| 8 | 28 | 0.1% | |
| 3 | 15 | < 0.1% | |
| 11 | 3 | < 0.1% | |
| (Missing) | 31957 | 89.9% |
| Value | Count | Frequency (%) | |
| 1 | 58 | 0.2% | |
| 2 | 688 | 1.9% | |
| 3 | 15 | < 0.1% | |
| 4 | 70 | 0.2% | |
| 5 | 858 | 2.4% |
| Value | Count | Frequency (%) | |
| 13 | 1068 | 3.0% | |
| 12 | 1 | < 0.1% | |
| 11 | 3 | < 0.1% | |
| 9 | 427 | 1.2% | |
| 8 | 28 | 0.1% |
stake
Real number (ℝ)
| Distinct count | 80 |
|---|---|
| Unique (%) | 0.2% |
| Missing | 70 |
| Missing (%) | 0.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 44.8041 |
|---|---|
| Minimum | -99 |
| Maximum | 99 |
| Zeros | 8 |
| Zeros (%) | < 0.1% |
| Memory size | 277.7 KiB |
Quantile statistics
| Minimum | -99 |
|---|---|
| 5-th percentile | 12 |
| Q1 | 25 |
| median | 45 |
| Q3 | 64 |
| 95-th percentile | 77 |
| Maximum | 99 |
| Range | 198 |
| Interquartile range (IQR) | 39 |
Descriptive statistics
| Standard deviation | 23.4533 |
|---|---|
| Coefficient of variation (CV) | 0.523463 |
| Kurtosis | 2.41726 |
| Mean | 44.8041 |
| Median Absolute Deviation (MAD) | 20 |
| Skewness | -0.4425 |
| Sum | 1.58961e+06 |
| Variance | 550.059 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 11 | 933 | 2.6% | |
| 77 | 925 | 2.6% | |
| 17 | 886 | 2.5% | |
| 71 | 859 | 2.4% | |
| 37 | 832 | 2.3% | |
| 76 | 830 | 2.3% | |
| 16 | 821 | 2.3% | |
| 72 | 812 | 2.3% | |
| 61 | 806 | 2.3% | |
| 21 | 801 | 2.3% | |
| Other values (70) | 26974 | 75.9% |
| Value | Count | Frequency (%) | |
| -99 | 94 | 0.3% | |
| 0 | 8 | < 0.1% | |
| 1 | 36 | 0.1% | |
| 2 | 2 | < 0.1% | |
| 3 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 99 | 693 | 1.9% | |
| 88 | 19 | 0.1% | |
| 87 | 25 | 0.1% | |
| 84 | 35 | 0.1% | |
| 82 | 2 | < 0.1% |
| Distinct count | 47 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 2015 |
| Missing (%) | 5.7% |
| Memory size | 277.7 KiB |
| DM | |
|---|---|
| PP | |
| DO | |
| PB | |
| RM | |
| Other values (42) |
| Value | Count | Frequency (%) | |
| DM | 10596 | 29.8% | |
| PP | 3123 | 8.8% | |
| DO | 3027 | 8.5% | |
| PB | 2891 | 8.1% | |
| RM | 2609 | 7.3% | |
| DS | 2504 | 7.0% | |
| OT | 2249 | 6.3% | |
| PF | 1597 | 4.5% | |
| PE | 1299 | 3.7% | |
| OL | 1006 | 2.8% | |
| Other values (37) | 2633 | 7.4% | |
| (Missing) | 2015 | 5.7% |
Length
| Max length | 3 |
|---|---|
| Median length | 2 |
| Mean length | 2.05668 |
| Min length | 2 |
| Distinct count | 5 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 2506 |
| Missing (%) | 7.0% |
| Memory size | 277.7 KiB |
| M | |
|---|---|
| F | |
| R | 3 |
| Z | 1 |
| P | 1 |
| Value | Count | Frequency (%) | |
| M | 17348 | 48.8% | |
| F | 15690 | 44.1% | |
| R | 3 | < 0.1% | |
| Z | 1 | < 0.1% | |
| P | 1 | < 0.1% | |
| (Missing) | 2506 | 7.0% |
Length
| Max length | 3 |
|---|---|
| Median length | 1 |
| Mean length | 1.14099 |
| Min length | 1 |
| Distinct count | 3 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 20103 |
| Missing (%) | 56.6% |
| Memory size | 277.7 KiB |
| Z | |
|---|---|
| J | 4 |
| ZJ | 1 |
| Value | Count | Frequency (%) | |
| Z | 15441 | 43.4% | |
| J | 4 | < 0.1% | |
| ZJ | 1 | < 0.1% | |
| (Missing) | 20103 | 56.6% |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.13103 |
| Min length | 1 |
| Distinct count | 4 |
|---|---|
| Unique (%) | 0.2% |
| Missing | 33898 |
| Missing (%) | 95.4% |
| Memory size | 277.7 KiB |
| J | |
|---|---|
| S | 3 |
| R | 3 |
| M | 1 |
| Value | Count | Frequency (%) | |
| J | 1644 | 4.6% | |
| S | 3 | < 0.1% | |
| R | 3 | < 0.1% | |
| M | 1 | < 0.1% | |
| (Missing) | 33898 | 95.4% |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.90711 |
| Min length | 1 |
| Distinct count | 6 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 25857 |
| Missing (%) | 72.7% |
| Memory size | 277.7 KiB |
| S | |
|---|---|
| R | |
| M | 709 |
| E | 2 |
| Z | 1 |
| Value | Count | Frequency (%) | |
| S | 7530 | 21.2% | |
| R | 1449 | 4.1% | |
| M | 709 | 2.0% | |
| E | 2 | < 0.1% | |
| Z | 1 | < 0.1% | |
| J | 1 | < 0.1% | |
| (Missing) | 25857 | 72.7% |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.45472 |
| Min length | 1 |
| Distinct count | 7 |
|---|---|
| Unique (%) | 0.4% |
| Missing | 33952 |
| Missing (%) | 95.5% |
| Memory size | 277.7 KiB |
| S | |
|---|---|
| P | 120 |
| B | 50 |
| E | 3 |
| R | 2 |
| Other values (2) | 2 |
| Value | Count | Frequency (%) | |
| S | 1420 | 4.0% | |
| P | 120 | 0.3% | |
| B | 50 | 0.1% | |
| E | 3 | < 0.1% | |
| R | 2 | < 0.1% | |
| Z | 1 | < 0.1% | |
| 5 | 1 | < 0.1% | |
| (Missing) | 33952 | 95.5% |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.91015 |
| Min length | 1 |
| Distinct count | 5 |
|---|---|
| Unique (%) | 0.4% |
| Missing | 34327 |
| Missing (%) | 96.6% |
| Memory size | 277.7 KiB |
| P | |
|---|---|
| Q | 36 |
| E | 5 |
| S | 1 |
| L | 1 |
| Value | Count | Frequency (%) | |
| P | 1179 | 3.3% | |
| Q | 36 | 0.1% | |
| E | 5 | < 0.1% | |
| S | 1 | < 0.1% | |
| L | 1 | < 0.1% | |
| (Missing) | 34327 | 96.6% |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.93125 |
| Min length | 1 |
| Distinct count | 5 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 30521 |
| Missing (%) | 85.9% |
| Memory size | 277.7 KiB |
| E | |
|---|---|
| B | |
| R | 252 |
| S | 10 |
| P | 3 |
| Value | Count | Frequency (%) | |
| E | 3988 | 11.2% | |
| B | 775 | 2.2% | |
| R | 252 | 0.7% | |
| S | 10 | < 0.1% | |
| P | 3 | < 0.1% | |
| (Missing) | 30521 | 85.9% |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.71712 |
| Min length | 1 |
| Distinct count | 4 |
|---|---|
| Unique (%) | 3.2% |
| Missing | 35423 |
| Missing (%) | 99.6% |
| Memory size | 277.7 KiB |
| L | |
|---|---|
| E | 3 |
| S | 1 |
| B | 1 |
| Value | Count | Frequency (%) | |
| L | 121 | 0.3% | |
| E | 3 | < 0.1% | |
| S | 1 | < 0.1% | |
| B | 1 | < 0.1% | |
| (Missing) | 35423 | 99.6% |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.99291 |
| Min length | 1 |
| Distinct count | 56 |
|---|---|
| Unique (%) | 0.2% |
| Missing | 4111 |
| Missing (%) | 11.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 29.2879 |
|---|---|
| Minimum | 2 |
| Maximum | 70 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 277.7 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 16 |
| Q1 | 21 |
| median | 32 |
| Q3 | 36 |
| 95-th percentile | 49 |
| Maximum | 70 |
| Range | 68 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 9.56476 |
|---|---|
| Coefficient of variation (CV) | 0.326577 |
| Kurtosis | -0.606196 |
| Mean | 29.2879 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 0.317434 |
| Sum | 920754 |
| Variance | 91.4846 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 36 | 4071 | 11.5% | |
| 37 | 3277 | 9.2% | |
| 35 | 2649 | 7.5% | |
| 21 | 2504 | 7.0% | |
| 20 | 2304 | 6.5% | |
| 22 | 1704 | 4.8% | |
| 16 | 1538 | 4.3% | |
| 26 | 1359 | 3.8% | |
| 34 | 1358 | 3.8% | |
| 17 | 1285 | 3.6% | |
| Other values (46) | 9389 | 26.4% | |
| (Missing) | 4111 | 11.6% |
| Value | Count | Frequency (%) | |
| 2 | 1 | < 0.1% | |
| 6 | 2 | < 0.1% | |
| 7 | 2 | < 0.1% | |
| 8 | 3 | < 0.1% | |
| 9 | 3 | < 0.1% |
| Value | Count | Frequency (%) | |
| 70 | 1 | < 0.1% | |
| 64 | 1 | < 0.1% | |
| 58 | 2 | < 0.1% | |
| 57 | 2 | < 0.1% | |
| 56 | 1 | < 0.1% |
| Distinct count | 255 |
|---|---|
| Unique (%) | 0.8% |
| Missing | 3266 |
| Missing (%) | 9.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 42.6724 |
|---|---|
| Minimum | 4 |
| Maximum | 280 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 277.7 KiB |
Quantile statistics
| Minimum | 4 |
|---|---|
| 5-th percentile | 9 |
| Q1 | 20 |
| median | 37 |
| Q3 | 48 |
| 95-th percentile | 132 |
| Maximum | 280 |
| Range | 276 |
| Interquartile range (IQR) | 28 |
Descriptive statistics
| Standard deviation | 36.6313 |
|---|---|
| Coefficient of variation (CV) | 0.858429 |
| Kurtosis | 6.13865 |
| Mean | 42.6724 |
| Median Absolute Deviation (MAD) | 14 |
| Skewness | 2.33109 |
| Sum | 1.37759e+06 |
| Variance | 1341.85 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 46 | 892 | 2.5% | |
| 44 | 879 | 2.5% | |
| 45 | 850 | 2.4% | |
| 43 | 839 | 2.4% | |
| 47 | 806 | 2.3% | |
| 42 | 803 | 2.3% | |
| 10 | 759 | 2.1% | |
| 48 | 749 | 2.1% | |
| 9 | 742 | 2.1% | |
| 49 | 739 | 2.1% | |
| Other values (245) | 24225 | 68.1% | |
| (Missing) | 3266 | 9.2% |
| Value | Count | Frequency (%) | |
| 4 | 17 | < 0.1% | |
| 5 | 58 | 0.2% | |
| 6 | 186 | 0.5% | |
| 7 | 552 | 1.6% | |
| 8 | 667 | 1.9% |
| Value | Count | Frequency (%) | |
| 280 | 1 | < 0.1% | |
| 278 | 1 | < 0.1% | |
| 275 | 1 | < 0.1% | |
| 274 | 1 | < 0.1% | |
| 270 | 1 | < 0.1% |
| Distinct count | 9358 |
|---|---|
| Unique (%) | 28.2% |
| Missing | 2324 |
| Missing (%) | 6.5% |
| Memory size | 277.7 KiB |
| 0 | 1574 |
|---|---|
| -1 | 117 |
| 110 | 43 |
| 4541 | 40 |
| 7470 | 39 |
| Other values (9353) |
| Value | Count | Frequency (%) | |
| 0 | 1574 | 4.4% | |
| -1 | 117 | 0.3% | |
| 110 | 43 | 0.1% | |
| 4541 | 40 | 0.1% | |
| 7470 | 39 | 0.1% | |
| 6660 | 38 | 0.1% | |
| 8613 | 37 | 0.1% | |
| 5588 | 36 | 0.1% | |
| 8100PB | 35 | 0.1% | |
| 4645 | 35 | 0.1% | |
| Other values (9348) | 31231 | 87.9% | |
| (Missing) | 2324 | 6.5% |
Length
| Max length | 6 |
|---|---|
| Median length | 4 |
| Mean length | 4.29194 |
| Min length | 1 |
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 30965 |
| Missing (%) | 87.1% |
| Memory size | 277.7 KiB |
| * | |
|---|---|
| 0 | 2 |
| Value | Count | Frequency (%) | |
| * | 4582 | 12.9% | |
| 0 | 2 | < 0.1% | |
| (Missing) | 30965 | 87.1% |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.7421 |
| Min length | 1 |
| Distinct count | 971 |
|---|---|
| Unique (%) | 2.9% |
| Missing | 1901 |
| Missing (%) | 5.3% |
| Memory size | 277.7 KiB |
| 0 | |
|---|---|
| 6998 | 20 |
| 7051 | 18 |
| 6723 | 17 |
| 8587 | 17 |
| Other values (966) | 1795 |
| Value | Count | Frequency (%) | |
| 0 | 31781 | 89.4% | |
| 6998 | 20 | 0.1% | |
| 7051 | 18 | 0.1% | |
| 6723 | 17 | < 0.1% | |
| 8587 | 17 | < 0.1% | |
| 7630 | 17 | < 0.1% | |
| 5703 | 15 | < 0.1% | |
| 7260 | 15 | < 0.1% | |
| 6656 | 14 | < 0.1% | |
| 6726 | 14 | < 0.1% | |
| Other values (961) | 1720 | 4.8% | |
| (Missing) | 1901 | 5.3% |
Length
| Max length | 6 |
|---|---|
| Median length | 1 |
| Mean length | 1.26355 |
| Min length | 1 |
| Distinct count | 1 |
|---|---|
| Unique (%) | 6.2% |
| Missing | 35533 |
| Missing (%) | > 99.9% |
| Memory size | 277.7 KiB |
| * |
|---|
| Value | Count | Frequency (%) | |
| * | 16 | < 0.1% | |
| (Missing) | 35533 | > 99.9% |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.9991 |
| Min length | 1 |
| Distinct count | 17 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 2071 |
| Missing (%) | 5.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.57922 |
|---|---|
| Minimum | 0 |
| Maximum | 8000 |
| Zeros | 33447 |
| Zeros (%) | 94.1% |
| Memory size | 277.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 8000 |
| Range | 8000 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 100.371 |
|---|---|
| Coefficient of variation (CV) | 38.9154 |
| Kurtosis | 3907.73 |
| Mean | 2.57922 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 56.0844 |
| Sum | 86347 |
| Variance | 10074.4 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=17)
| Value | Count | Frequency (%) | |
| 0 | 33447 | 94.1% | |
| 2137 | 5 | < 0.1% | |
| 1998 | 4 | < 0.1% | |
| 2345 | 4 | < 0.1% | |
| 2108 | 3 | < 0.1% | |
| 8000 | 3 | < 0.1% | |
| 2036 | 2 | < 0.1% | |
| 2346 | 1 | < 0.1% | |
| 2412 | 1 | < 0.1% | |
| 2045 | 1 | < 0.1% | |
| Other values (7) | 7 | < 0.1% | |
| (Missing) | 2071 | 5.8% |
| Value | Count | Frequency (%) | |
| 0 | 33447 | 94.1% | |
| 1359 | 1 | < 0.1% | |
| 1998 | 4 | < 0.1% | |
| 2017 | 1 | < 0.1% | |
| 2036 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 8000 | 3 | < 0.1% | |
| 4525 | 1 | < 0.1% | |
| 2500 | 1 | < 0.1% | |
| 2412 | 1 | < 0.1% | |
| 2346 | 1 | < 0.1% |
| Distinct count | 15 |
|---|---|
| Unique (%) | 0.8% |
| Missing | 33718 |
| Missing (%) | 94.8% |
| Memory size | 277.7 KiB |
| SE | |
|---|---|
| E | |
| NE | |
| SW | |
| N | |
| Other values (10) |
| Value | Count | Frequency (%) | |
| SE | 387 | 1.1% | |
| E | 256 | 0.7% | |
| NE | 250 | 0.7% | |
| SW | 237 | 0.7% | |
| N | 195 | 0.5% | |
| W | 188 | 0.5% | |
| S | 179 | 0.5% | |
| NW | 131 | 0.4% | |
| AT | 2 | < 0.1% | |
| 5 | 1 | < 0.1% | |
| Other values (5) | 5 | < 0.1% | |
| (Missing) | 33718 | 94.8% |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.92546 |
| Min length | 1 |
| Distinct count | 82 |
|---|---|
| Unique (%) | 1.5% |
| Missing | 30113 |
| Missing (%) | 84.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 19.7564 |
|---|---|
| Minimum | -2 |
| Maximum | 99 |
| Zeros | 3055 |
| Zeros (%) | 8.6% |
| Memory size | 277.7 KiB |
Quantile statistics
| Minimum | -2 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 43 |
| 95-th percentile | 73 |
| Maximum | 99 |
| Range | 101 |
| Interquartile range (IQR) | 43 |
Descriptive statistics
| Standard deviation | 26.5344 |
|---|---|
| Coefficient of variation (CV) | 1.34308 |
| Kurtosis | -0.346169 |
| Mean | 19.7564 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.0004 |
| Sum | 107396 |
| Variance | 704.075 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 3055 | 8.6% | |
| 46 | 176 | 0.5% | |
| 51 | 161 | 0.5% | |
| 37 | 134 | 0.4% | |
| 71 | 132 | 0.4% | |
| 75 | 94 | 0.3% | |
| 22 | 84 | 0.2% | |
| 52 | 81 | 0.2% | |
| 14 | 80 | 0.2% | |
| 61 | 76 | 0.2% | |
| Other values (72) | 1363 | 3.8% | |
| (Missing) | 30113 | 84.7% |
| Value | Count | Frequency (%) | |
| -2 | 2 | < 0.1% | |
| -1 | 9 | < 0.1% | |
| 0 | 3055 | 8.6% | |
| 1 | 15 | < 0.1% | |
| 2 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 99 | 39 | 0.1% | |
| 88 | 9 | < 0.1% | |
| 87 | 9 | < 0.1% | |
| 86 | 2 | < 0.1% | |
| 85 | 2 | < 0.1% |
| Distinct count | 6 |
|---|---|
| Unique (%) | 0.9% |
| Missing | 34908 |
| Missing (%) | 98.2% |
| Memory size | 277.7 KiB |
| TE | |
|---|---|
| TR | |
| UT | |
| TB | 18 |
| TA | 11 |
| Value | Count | Frequency (%) | |
| TE | 427 | 1.2% | |
| TR | 124 | 0.3% | |
| UT | 55 | 0.2% | |
| TB | 18 | 0.1% | |
| TA | 11 | < 0.1% | |
| TL | 6 | < 0.1% | |
| (Missing) | 34908 | 98.2% |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.98197 |
| Min length | 2 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| recordID | mo | dy | yr | period | plot | note1 | stake | species | sex | age | reprod | testes | vagina | pregnant | nipples | lactation | hfl | wgt | tag | note2 | ltag | note3 | prevrt | prevlet | nestdir | neststk | note4 | note5 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | 7 | 16 | 1977 | 1.0 | 2 | NaN | 16.0 | NaN | M | Z | NaN | NaN | NaN | NaN | NaN | NaN | 32.0 | NaN | 0 | NaN | 0 | NaN | 0 | 0.0 | NaN | 0.0 | NaN | NaN |
| 1 | 2 | 7 | 16 | 1977 | 1.0 | 3 | NaN | 23.0 | NaN | M | Z | NaN | NaN | NaN | NaN | NaN | NaN | 33.0 | NaN | 0 | NaN | 0 | NaN | 0 | 0.0 | NaN | 0.0 | NaN | NaN |
| 2 | 3 | 7 | 16 | 1977 | 1.0 | 2 | NaN | 25.0 | DM | F | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 37.0 | NaN | 0 | NaN | 0 | NaN | 0 | 0.0 | NaN | 0.0 | NaN | NaN |
| 3 | 4 | 7 | 16 | 1977 | 1.0 | 7 | NaN | 25.0 | DM | M | Z | NaN | NaN | NaN | NaN | NaN | NaN | 36.0 | NaN | 0 | NaN | 0 | NaN | 0 | 0.0 | NaN | 0.0 | NaN | NaN |
| 4 | 5 | 7 | 16 | 1977 | 1.0 | 3 | NaN | 26.0 | DM | M | Z | NaN | NaN | NaN | NaN | NaN | NaN | 35.0 | NaN | 0 | NaN | 0 | NaN | 0 | 0.0 | NaN | 0.0 | NaN | NaN |
| 5 | 6 | 7 | 16 | 1977 | 1.0 | 1 | NaN | 27.0 | PF | M | NaN | J | NaN | NaN | NaN | NaN | NaN | 14.0 | NaN | 0 | NaN | 0 | NaN | 0 | 0.0 | NaN | 0.0 | NaN | NaN |
| 6 | 7 | 7 | 16 | 1977 | 1.0 | 2 | NaN | 31.0 | PE | F | NaN | NaN | NaN | NaN | P | NaN | NaN | NaN | NaN | 0 | NaN | 0 | NaN | 0 | 0.0 | NaN | 0.0 | NaN | NaN |
| 7 | 8 | 7 | 16 | 1977 | 1.0 | 1 | NaN | 36.0 | DM | M | NaN | NaN | S | NaN | NaN | NaN | NaN | 37.0 | NaN | 0 | NaN | 0 | NaN | 0 | 0.0 | NaN | 0.0 | NaN | NaN |
| 8 | 9 | 7 | 16 | 1977 | 1.0 | 1 | NaN | 42.0 | DM | F | Z | NaN | NaN | NaN | NaN | NaN | NaN | 34.0 | NaN | 0 | NaN | 0 | NaN | 0 | 0.0 | NaN | 0.0 | NaN | NaN |
| 9 | 10 | 7 | 16 | 1977 | 1.0 | 6 | NaN | 46.0 | PF | F | Z | NaN | NaN | NaN | NaN | NaN | NaN | 20.0 | NaN | 0 | NaN | 0 | NaN | 0 | 0.0 | NaN | 0.0 | NaN | NaN |
Last rows
| recordID | mo | dy | yr | period | plot | note1 | stake | species | sex | age | reprod | testes | vagina | pregnant | nipples | lactation | hfl | wgt | tag | note2 | ltag | note3 | prevrt | prevlet | nestdir | neststk | note4 | note5 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 35539 | 35540 | 12 | 31 | 2002 | 295.0 | 15 | NaN | 54.0 | PB | F | Z | NaN | NaN | NaN | NaN | NaN | NaN | 26.0 | 23.0 | 482738 | NaN | 0 | NaN | 0 | 0.0 | NaN | NaN | NaN | NaN |
| 35540 | 35541 | 12 | 31 | 2002 | 295.0 | 15 | NaN | 34.0 | PB | F | NaN | NaN | NaN | NaN | NaN | R | NaN | 24.0 | 31.0 | 716537 | NaN | 0 | NaN | 0 | 0.0 | NaN | NaN | NaN | NaN |
| 35541 | 35542 | 12 | 31 | 2002 | 295.0 | 15 | NaN | 23.0 | PB | F | Z | NaN | NaN | NaN | NaN | NaN | NaN | 26.0 | 29.0 | 0F7659 | NaN | 0 | NaN | 0 | 0.0 | NaN | NaN | NaN | NaN |
| 35542 | 35543 | 12 | 31 | 2002 | 295.0 | 15 | NaN | 77.0 | PB | F | Z | NaN | NaN | NaN | NaN | NaN | NaN | 27.0 | 34.0 | 701178 | NaN | 0 | NaN | 0 | 0.0 | NaN | NaN | NaN | NaN |
| 35543 | 35544 | 12 | 31 | 2002 | 295.0 | 15 | NaN | 64.0 | US | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 35544 | 35545 | 12 | 31 | 2002 | 295.0 | 15 | NaN | 32.0 | AH | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 35545 | 35546 | 12 | 31 | 2002 | 295.0 | 15 | NaN | 45.0 | AH | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 35546 | 35547 | 12 | 31 | 2002 | 295.0 | 10 | NaN | 32.0 | RM | F | NaN | NaN | NaN | NaN | NaN | R | NaN | 15.0 | 14.0 | NaN | NaN | 0 | NaN | 0 | 0.0 | NaN | NaN | UT | NaN |
| 35547 | 35548 | 12 | 31 | 2002 | 295.0 | 7 | NaN | 13.0 | DO | M | NaN | NaN | M | NaN | NaN | NaN | NaN | 36.0 | 51.0 | NaN | NaN | 0 | NaN | 0 | 0.0 | NaN | NaN | UT | NaN |
| 35548 | 35549 | 12 | 31 | 2002 | 295.0 | 5 | 2.0 | 99.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |